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In the claims: 

Please amend the claims as follows: 

1. (Original) A method for separating two or more 
subsets of polypeptides within a set of polypeptides, 
comprising: 

(a) determining a sequence comparison signature 
for each amino acid sequence in a set of amino acid 
sequences, wherein said sequence comparison signature 
comprises pairwise comparison scores for said amino acid 
sequence compared to each of the other amino acid sequences 
in said set; 

(b) constructing a distance arrangement 
comprising said sequence comparison signatures related 
according to the distance between each of said sequence 
comparison signatures; and 

(c) identifying a first and second cluster of 
sequence comparison signatures in the distance arrangement, 
wherein said first cluster comprises sequence comparison 
signatures for polypeptides having a similar protein fold 
or biological function, said protein fold or function being 
different compared to a protein fold or function of 
polypeptides having sequence comparison signatures in said 
second cluster. 

2. (Original) The method of claim 1, wherein said 
pairwise comparison score is determined by an algorithm 
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selected from the group consisting of Smith-Waterman, 
BLAST, FASTA, Needleman-Wunsch, Seller or PSI-BLAST. 

3. (Original) The method of claim 1, wherein said 
distance comprises a distance selected from the group 
consisting of a Euclidian distance, exclusive OR distance 
and Tanimoto coefficient. 

4. (Original) The method of claim 1, wherein said 
distance comprises the distance between a sequence 
comparison signature and a set of sequence comparison 
signatures . 

5. (Original) The method of claim 1, wherein said 
distance comprises a distance selected from the group 
consisting of a Penrose distance and Mahalanobis distance. 

6. (Original) The method of claim 1, wherein said 
cluster of sequence comparison signatures is identified by 
hierarchical clustering. 

7. (Original) The method of claim 6, wherein said 
hierarchical clustering is selected from the group 
consisting of agglomerative clustering and divisive 
clustering. 
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8. (Original) The method of claim 1, wherein said 
cluster of sequence comparison signatures is identified by 
non-hierarchical clustering. 

9. (Original) The method of claim 8, wherein said 
non-hierarchical clustering comprises Jarvis-Patrick 
clustering. 

10. (Original) The method of claim 1, wherein 
said cluster of sequence comparison signatures is 
identified by cell-based clustering. 

11. (Original) The method of claim 1, wherein 
said subset of polypeptides comprises a family of proteins 
having a common structural fold. 

12. (Original) The method of claim 1, wherein 
said subset of polypeptides comprises a family of proteins 
having a common function. 

13. (Original) A method for identifying a member 
of a polypeptide family, comprising: 

(a) determining a query sequence comparison 
signature for an amino acid sequence, wherein said query 
sequence comparison signature comprises pairwise comparison 
scores for said amino acid sequence compared to each amino 
acid sequence in a set; 
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(b) comparing the distance between said query- 
sequence comparison signature and the sequence comparison 
signatures for other amino acid sequences in said set, 
wherein said sequence comparison signatures for other amino 
acid sequences in said set are clustered into polypeptide 
families; and 

(c) identifying a proximal cluster having one or 
more sequence comparison signature that has a closer 
distance to said query sequence comparison signature than 
the sequence comparison signatures of a distal cluster, 
thereby identifying the polypeptide having said query 
sequence comparison signature as being a member of the 
polypeptide family for the proximal cluster. 

14. (Original) The method of claim 13, wherein 
said pairwise comparison score is determined by an 
algorithm selected from the group consisting of Smith- 
Waterman, BLAST, FASTA, " Needleman-Wunsch, Seller or PSI- 
BLAST. 

15. (Original) The method of claim 13, wherein 
said distance comprises a distance selected from the group 
consisting of a Euclidian distance, exclusive OR distance 
and Tanimoto coefficient. 

16. (Original) The method of claim 13, wherein 
said distance comprises the distance between a sequence 
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comparison signature and a set of sequence comparison 
signatures . 

17. (Original) The method of claim 13, wherein 
said distance comprises a distance selected from the group 
consisting of a Penrose distance and Mahalanobis distance. 

18. (Original) The method of claim 13, wherein 
said cluster of sequence comparison signatures is 
identified by hierarchical clustering. 

19. (Original) The method of claim 18, wherein 
said hierarchical clustering is selected from the group 
consisting of agglomerative clustering and divisive 
clustering. 

20. (Original) The method of claim 13, wherein 
said cluster of sequence comparison signatures is 
identified by non-hierarchical clustering. 

21. (Amended) The method of claim [83] 20, 
wherein said non-hierarchical clustering comprises Jarvis- 
Patrick clustering. 

22. (Original) The method of claim 13, wherein 
said cluster of sequence comparison signatures is 
identified by cell-based clustering. 
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23. (Original) The method of claim 13, wherein 
said polypeptide family comprises polypeptides having a 
common structural fold. 

24. (Original) The method of claim 13, wherein 
said polypeptide family comprises polypeptides having a 
common function. 

25. (Original) A method for identifying a 
polypeptide pharmacof amily, comprising: 

(a) determining a sequence comparison signature 
for each amino acid sequence in a set of amino acid 
sequences, wherein said sequence comparison signature 
comprises pairwise comparison scores for said amino acid 
sequence compared to each of the other amino acid sequences 
in said set; 

(b) constructing a distance arrangement 
comprising said sequence comparison signatures related 
according to the distance between each of said sequence 
comparison signatures; and 

(c) identifying separate clusters of sequence 
comparison signatures in said distance arrangement, wherein 
said separate clusters comprise sequence comparison 
signatures for sequences in the same ligand binding family 
and separate pharmacof amilies . 

26. (Original) The method of claim 25, wherein 
said pairwise comparison score is determined by an 
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algorithm selected from the group consisting of Smith- 
Waterman, BLAST, FASTA, Needleman-Wunsch, Seller or PSI- 
BLAST. 

27. (Original) The method of claim 25, wherein 
said distance comprises a distance selected from the group 
consisting of a Euclidian distance, exclusive OR distance 
and Tanimoto coefficient. 

28. (Original) The method of claim 25, wherein 
said distance comprises the distance between a sequence 
comparison signature and a set of sequence comparison 
signatures . 

29. (Original) The method of claim 25, wherein 
said distance comprises a distance selected from the group 
consisting of a Penrose distance and Mahalanobis distance. 

30. (Original) The method of claim 25, wherein 
said cluster of sequence comparison signatures is 
identified by hierarchical clustering. 

31. (Original) The method of claim 30, wherein 
said hierarchical clustering is selected from the group 
consisting of agglomerative clustering and divisive 
clustering. 
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32. (Original) The method of claim 25, wherein 
said cluster of sequence comparison signatures is 
identified by non-hierarchical clustering. 

33. (Original) The method of claim 32, wherein 
said non-hierarchical clustering comprises Jarvis-Patrick 
clustering. 

34. (Original) The method of claim 25, wherein 
said cluster of sequence comparison signatures is 
identified by cell-based clustering. 

35. (Original) The method of claim 25, wherein 
said ligand comprises a nicotinamide adenine dinucleotide- 
related molecule. 

36. (Original) The method of claim 35, wherein 
said nicotinamide adenine dinucleotide-related molecule is 
selected from the group consisting of oxidized nicotinamide 
adenine dinucleotide, reduced nicotinamide adenine 
dinucleotide, oxidized nicotinamide adenine dinucleotide 
phosphate, reduced nicotinamide adenine dinucleotide 
phosphate, and a mimetic thereof. 

37. (Original) A method for identifying a member 
of a pharmacof amily, comprising: 

(a) determining a query sequence comparison 
signature for an amino acid sequence, wherein said query 
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sequence comparison signature comprises pairwise comparison 
scores for said amino acid sequence compared to each amino 
acid sequence in a set; 

(b) comparing the distance between said query 
sequence comparison signature and the sequence comparison 
signatures for other amino acid sequences in said set, 
wherein said sequence comparison signatures for other amino 
acid sequences in said set are clustered into 

pharmacof amilies ; and 

(c) identifying a proximal cluster having one or 
more sequence comparison signature that has a closer 
distance to said query sequence comparison signature than 
the sequence comparison signatures of a distal cluster, 
thereby identifying the sequences having said query 
sequence comparison signature as being a member of the 
pharmacofamily for the proximal cluster, wherein the 
pharmacofamilies for the proximal and distal clusters 
belong to the same ligand binding family. 

38. (Original) The method of claim 37, wherein 
said pairwise comparison score is determined by an 
algorithm selected from the group consisting of Smith- 
Waterman, BLAST, FASTA, Needleman-Wunsch, Seller or PSI- 
BLAST. 

39. (Original) The method of claim 37, wherein 
said distance comprises a distance selected from the group 
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consisting of a Euclidian distance, exclusive OR distance 
and Tanimoto coefficient. 

40. (Original) The method of claim 37, wherein 
said distance comprises the distance between a sequence 
comparison signature and a set of sequence comparison 
signatures. 

41. (Original) The method of claim 40, wherein 
said distance comprises a distance selected from the group 
consisting of a Penrose distance and Mahalanobis distance. 

42. (Original) The method of claim 37, wherein 
said cluster of sequence comparison signatures is 
identified by hierarchical clustering. 

43. (Original) The method of claim 42, wherein 
said hierarchical clustering is selected from the group 
consisting of agglomerative clustering and divisive 
clustering. 

44. (Original) The method of claim 42, wherein 
said cluster of sequence comparison signatures is 
identified by non-hierarchical clustering. 



45. (Original) The method of claim 44, wherein 
said non-hierarchical clustering comprises Jarvis-Patrick 
clustering. 
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46. (Original) The method of claim 37, wherein 
said cluster of sequence comparison signatures is 
identified by cell -based clustering. 

47. (Original) The method of claim 37, wherein 
said ligand comprises a nicotinamide adenine dinucleotide- 
related molecule. 

48. (Original) The method of claim 47, wherein 
said nicotinamide adenine dinucleotide-related molecule is 
selected from the group consisting of oxidized nicotinamide 
adenine dinucleotide, reduced nicotinamide adenine 
dinucleotide, oxidized nicotinamide adenine dinucleotide 
phosphate, reduced nicotinamide adenine dinucleotide 
phosphate, and a mimetic thereof. 

49. (Original) A method for constructing a 
conformer model, comprising: 

(a) determining a sequence comparison signature 
for each amino acid sequence in a set of amino acid 
sequences, wherein said sequence comparison signature 
comprises pairwise comparison scores for said amino acid 
sequence compared to each of the other amino acid sequences 
in said set; 

(b) constructing a distance arrangement 
comprising said sequence comparison signatures related 
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according to the distance between each of said sequence 
comparison signatures ; 

(c) identifying separate clusters of sequence 
comparison signatures in said distance arrangement, wherein 
said separate clusters include sequence comparison 
signatures for amino acid sequences in the same ligand 
binding family and separate pharmacof amilies; 

(d) determining bound conformations of said 
ligand bound to the members of a pharmacof ami ly; and 

(e) constructing an average structure of said 
bound conformations, wherein said average structure is a 
conformer model of said ligand. 

50. (Original) The method of claim 49, wherein 
said pairwise comparison score is determined by an 
algorithm selected from the group consisting of Smith- 
Waterman, BLAST, FASTA, Needleman-Wunsch, Seller or PSI- 
BLAST. 

51. (Original) The method of claim 49, wherein 
said distance comprises a distance selected from the group 
consisting of a Euclidian distance, exclusive OR distance 
and Tanimoto coefficient. 

52. (Original) The method of claim 49, wherein 
said distance comprises the distance between a sequence 
comparison signature and a set of sequence comparison 
signatures . 
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53. (Original) The method of claim 52, wherein 
said distance comprises a distance selected from the group 
consisting of a Penrose distance and Mahalanobis distance. 

54. (Original) The method of claim 49, wherein 
said cluster of sequence comparison signatures is 
identified by hierarchical clustering. 

55. (Original) The method of claim 54, wherein 
said hierarchical clustering is selected from the group 
consisting of agglomerative clustering and divisive 
clustering. 

56. (Original) The method of claim 49, wherein 
said cluster of sequence comparison signatures is 
identified by non-hierarchical clustering. 

57. (Original) The method of claim 56, wherein 
said non-hierarchical clustering comprises Jarvis-Patrick 
clustering. 

58. (Original) The method of claim 49, wherein 
said cluster of sequence comparison signatures is 
identified by cell-based clustering. 
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59. (Original) The method of claim 49, wherein 
said ligand comprises a nicotinamide adenine dinucleotide- 
related molecule. 

60. (Original) The method of claim 59, wherein 
said nicotinamide adenine dinucleotide-related molecule is 
selected from the group consisting of oxidized nicotinamide 
adenine dinucleotide, reduced nicotinamide adenine 
dinucleotide, oxidized nicotinamide adenine dinucleotide 
phosphate, reduced nicotinamide adenine dinucleotide 
phosphate, and a mimetic thereof. 

61. (Original) A method for constructing a 
pharmacophore model, comprising: 

(a) determining a sequence comparison signature 
for each amino acid sequence in a set of amino acid 
sequences, wherein said sequence comparison signature 
comprises pairwise comparison scores for said amino acid 
sequence compared to each of the other amino acid sequences 
in said set; 

(b) constructing a distance arrangement 
comprising said sequence comparison signatures related 
according to the distance between each of said sequence 
comparison signatures; 

(c) identifying separate clusters of sequence 
comparison signatures in said distance arrangement, wherein 
said separate clusters comprise sequence comparison 
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signatures for amino acid sequences in the same ligand 
binding family and separate pharmacof amilies; 

(d) comparing the bound conformations of said 
ligand bound to members of one of said pharmacof amilies; 

(e) identifying one or more conformation- 
dependent properties of said ligand bound to members of one 
of said pharmacof amilies; and 

(f) constructing a pharmacophore model that 
contains said one or more conformation-dependent 
properties. 

62. (Original) The method of claim 61, wherein 
said pairwise comparison score is determined by an 
algorithm selected from the group consisting of Smith- 
Waterman, BLAST, FASTA, Needleman-Wunsch, Seller or PSI- 
BLAST. 

63. (Original) "The method of claim 61, wherein 
said distance comprises a distance selected from the group 
consisting of a Euclidian distance, exclusive OR distance 
and Tanimoto coefficient. 

64. (Original) The method of claim 61, wherein 
said distance comprises the distance between a sequence 
comparison signature and a set of sequence comparison 
signatures . 
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65. (Original) The method of claim 64, wherein 
said distance comprises a distance selected from the group 
consisting of a Penrose distance and Mahalanobis distance. 

66. (Original) The method of claim 61, wherein 
said cluster of sequence comparison signatures is 
identified by hierarchical clustering. 

67. (Original) The method of claim 66, wherein 
said hierarchical clustering is selected from the group 
consisting of agglomerative clustering and divisive 
clustering. 

68. (Original) The method of claim 61, wherein 
said cluster of sequence comparison signatures is 
identified by non-hierarchical clustering. 

69. (Original) The method of claim 68, wherein 
said non-hierarchical clustering comprises Jarvis- Patrick 
clustering. 

70. (Original) The method of claim 61, wherein 
said cluster of sequence comparison signatures is 
identified by cell-based clustering. 

71. (Original) The method of claim 61, wherein 
said ligand comprises a nicotinamide adenine dinucleotide- 
related molecule. 
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72. (Original) The method of claim 71, wherein 
said nicotinamide adenine dinucleotide-related molecule is 
selected from the group consisting of oxidized nicotinamide 
adenine dinucleotide, reduced nicotinamide adenine 
dinucleotide, oxidized nicotinamide adenine dinucleotide 
phosphate, reduced nicotinamide adenine dinucleotide 
phosphate, and a mimetic thereof. 

73. (Original) The method of claim 72, wherein 
said conformation- dependent property comprises a 
spectroscopic signal . 

74. (Original) The method of claim 72, wherein 
said conformation-dependent property comprises an NMR 
signal . 

75. (Original) The method of claim 74, wherein 
said NMR signal is selected from the group consisting of 
chemical shift, J coupling, dipolar coupling, cross- 
correlation, nuclear spin relaxation, transferred nuclear 
Overhauser effect, and any combination thereof. 

76. (Original) A method for predicting the bound 
conformation of a ligand bound to polypeptide, comprising: 

(a) determining a query sequence comparison 
signature for an amino acid sequence, wherein said query 
sequence comparison signature comprises pairwise comparison 
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scores for said amino acid sequence compared to each amino 
acid sequence in a set; 

(b) comparing the distance between said query- 
sequence comparison signature and the sequence comparison 
signatures for other amino acid sequences in said set, 
wherein said sequence comparison signatures for other amino 
acid sequences in said set are clustered into 

pharmacof amilies ; 

(c) identifying a proximal cluster having one or 
more sequence comparison signature that has a closer 
distance to said query sequence comparison signature than 
the sequence comparison signatures of a distal cluster, 
thereby identifying the sequences having said query 
sequence comparison signature as being a member of the 
pharmacof amily for the proximal cluster, wherein the 
pharmacof amilies for the proximal and distal clusters 
belong to the same ligand binding family; and 

(d) obtaining a pharmacophore model of said 
ligand bound to said pharmacof amily for the proximal 
cluster, wherein said pharmacophore model comprises a 
prediction of the bound conformation for said ligand bound 
to the amino acid sequence having said query sequence 
comparison signature. 

77. (Original) The method of claim 76, wherein 
said pairwise comparison score is determined by an 
algorithm selected from the group consisting of Smith- 
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Waterman, BLAST, FASTA, Needleman-Wunsch, Seller or PSI- 
BLAST. 

78. (Original) The method of claim 76, wherein 
said distance comprises a distance selected from the group 
consisting of a Euclidian distance, exclusive OR distance 
and Tanimoto coefficient. 

79. (Original) The method of claim 76, wherein 
said distance comprises the distance between a sequence 
comparison signature and a set of sequence comparison 
signatures . 

80. (Original) The method of claim 79, wherein 
said distance comprises a distance selected from the group 
consisting of a Penrose distance and Mahalanobis distance. 

81. (Original) The method of claim 76, wherein 
said cluster of sequence comparison signatures is 
identified by hierarchical clustering. 

82. (Original) The method of claim 81, wherein 
said hierarchical clustering is selected from the group 
consisting of agglomerative clustering and divisive 
clustering. 
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83. (Original) The method of claim 76, wherein 
said cluster of sequence comparison signatures is 
identified by non-hierarchical clustering. 

84. (Original) The method of claim 83, wherein 
said non-hierarchical clustering comprises Jarvis -Patrick 
clustering. 



85. (Original) The method of claim 76, wherein 
said cluster of sequence comparison signatures is 
identified by cell-based clustering. 

86. (Original) The method of claim 76, wherein 
said ligand comprises a nicotinamide adenine dinucleotide - 
related molecule. 



87. (Original) The method of claim 86, wherein 
said nicotinamide adenine dinucleotide-related molecule is 
selected from the group consisting of oxidized nicotinamide 
adenine dinucleotide , reduced nicotinamide adenine 
dinucleotide, oxidized nicotinamide adenine dinucleotide 
phosphate, reduced nicotinamide adenine dinucleotide 
phosphate, and a mimetic thereof. 



